BIOTECHNOLOGY #fl ?-0 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: f/S^/^°^ 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(Sjuspto.£Ov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25 . 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 



t 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nucleic* 

Wrapped Aminos 



SUGGESTED CORRECTION - SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The numberAext at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right, margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length, this includes white spaces. 



Misaligned Amino 
Numbering 

Non- ASCII 



_ Variable Length 



Patenlln 2.0 
"bug" 



7 Skipped Sequences 

(OLD RULES) 



The numbering under each 3* amino acid is misaligned. Do not use tab codes between numbers; 
use apace characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text. 

" •" • " • • \ 

Sequences) contain n's or Xaa's representing more than, one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused Aft <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would" automatically, generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences ' Sequences) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



10 



(NEW RULES) <210> sequence id number 

<400> sequence id number 
000 

_Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

_Invalid <213> Per 1.823 of Sequence Rules, the only valid <213> responses arc: Unknown, Artificial Sequence, or 

Response scientific name (Genus/species). <220>-<223> section is required when <2 13> response is Unknown or 

is Artificial Sequence 

1 1 Use 0 f <220> Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. • 

Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register " 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

1 2 Patentln 2.0 Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Inst cad, please use "File Manager" or any other manual means to copy file to floppy disk. . 



13 Misuse of n " n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 
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PCT09 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/673,918 



DATE: 09/05/2001 
TIME: 14:37:47 



Input Set : A:\16430seq.txt 

Output Set: N:\CRF3\09052001\l673918.raw 



<110>" 



<120> 



3 
4 
5 
6 
8 
9 

11 <130> 

13 <140> 

14 <141> 

16 <150> 

17 <151> 

19 <150> 

20 <151> 
22 <160> 

<170> 



Does Not Comply 
Corrected Diskette Needed 



24 



APPLICANT: Xia, Zhi-Qiang 
Costa, 'Michael A 
Davin, Laurence B 
Lewis, Norman G 

TITLE OF INVENTION : Recombinant Secoisolariciresinol Dehydrogenase, 
Methods of Use 

FILE REFERENCE: WSUR116430 . 
CURRENT APPLICATION NUMBER: 09/673,918 * J 

CURRENT FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: PCT/US99/08 975 
PRIOR FILING DATE: 1999-04-23 
PRIOR APPLICATION NUMBER: 60/082,977 
PRIOR FILING DATE: 1998-04-24 
NUMBER OF SEQ ID NOS : 25 
SOFTWARE: Patent In Ver. 2.0 



and 



4f 



x 



ERRORED SEQUENCES 



395 


<210> SEQ ID NO: 


: 6 
























396 


<211> LENGTH: 273 
























397 


<212> TYPE: 


PRT 


























398 


<213> ORGANISM: 


Forsythia x 


intermedia 














400 


<4 00> SEQUENCE: 


6 
























401 


Met 


Gin 


Leu 


Arg 


Thr 


Ala 


He 


Ala 


Arg" 


Arg 


Leu 


Glu 


Gly 


Lys 


Val 


Ala 


402 


1 








5 










10 










15 




404 


Leu 


He 


Thr 


Gly 


Gly 


Ala 


Ser 


Gly 


Val 


Gly 


Glu 


Val 


Thr 


Ala 


Lys 


Leu 


405 








20 










25 










30 






407 


Phe 


Ser 


Gin 


His 


Gly 


Ala 


Lys 


Val 


Ala 


He 


Ala 


Asp 


Val 


Gin 


Asp 


Glu 


408 






35 










40 










45 








410 


Leu 


Gly 


His 


Ser 


Val 


Val 


Glu 


Ala 


He 


Gly 


Pro 


Ser 


Asn 


Ser 


Thr 


Tyr 


411 




50 










55 










60 










413 


He 


His 


Cys 


Asp 


Val 


Thr 


Asn 


Glu 


Asp 


Gly 


Val 


Lys 


Asn 


Ala 


Val 


Asp 


414 


65 










70 










75 










80 


416 


Asn 


Thr 


Val 


Ser 


Thr 


Tyr 


Gly 


Lys 


Leu 


Asp 


He 


Met 


Phe 


Asn 


Asn 


Ala 


417 










85 










90 










95 




419 


Gly 


He 


Ser 


Asp 


Pro 


Tyr 


Lys 


Pro 


Arg 


Val 


He 


Asp 


Asn 


Glu 


Lys 


Ala 


420 








100 










105 










110 






E— > 422 


Asp 


Phe 


Glu 


Arg 


Val 


Leu 


Ser 




Asnl 




)Thr 


Gly 


Val 


Phe 


Leu 


Phe 


423 






115 














© 






125 








425 


Met 


Lys 


His 


Ala 


Ala 


Arg 


He 


Met 


Val 


Pro 


Ala 


Arg 


Asn 


Gly 


Cys 


■He 


426 




130 










135 










140 










428 


He 


Ser 


Thr 


Ala 


Ser 


Leu 


Ser 


Ser 


Thr 


Met 


Gly Gly 


Gly 


Ser 


Ser 


His 


429 


145 










150 










155 










160 


431 


Ala 


Tyr 


Cys 


Gly 


Ala 


Lys 


His 


Ala 


Val 


Leu 


Gly 


Leu 


Thr 


Arg 


Asn 


Leu 


432 










165 










170 










175 




434 


Ala 


Val 


Glu 


Leu 


Gly 


Gin 


Phe 


Gly 


He 


Arg 


Val 


Asn 


Cys 


Leu 


Ser 


Pro 



^^^^^ 

jju 



file://C:\Crf3\Outhold\VsrI6739 1 8.htm 



9/5/01 



Page 2 of 4 



RAW SEQUENCE LISTING DATE: 09/05/2001 

PATENT APPLICATION: US/09/673,918 TIME: 14:37:47 

Input Set : A:\16430seq.txt 

Output Set: N:\CRF3\09052001\l673918.raw 



435 








180 










185 










190 






437 


Phe 


Gly 


Leu 


Pro 


Thr 


Pro 


Leu 


Ala 


Lys 


Lys 


Phe 


Ser 


Gly 


He 


Glu 


Asn 


438 






195 










200 










205 








440 


Asp 


Val 


Asp 


Phe 


Ala 


Asn 


Ala 


He 


Glu 


His 


Ala 


Gly Asn 


Leu 


Lys 


Gly 


441 




210 










215 










220 










443 


Thr 


Lys 


Leu Arg 


He 


Glu 


Asp 


Val 


Ala 


Asn 


Ala 


Ala 


Leu 


Phe 


Leu 


Ala 


444 


225 










230 










235 










240 


446 


Ser 


Asp 


Glu 


Ala 


Gin 


Tyr 


Val 


Ser 


Gly 


Gin 


Asn 


Leu 


Phe 


He 


Asp 


Gly 


447 










245 










250 










255 




449 


Gly 


Phe 


Ser 


Val 


Cys 


Asn 


Ser 


Ala 


He 


Lys 


Met 


Phe 


Gin 


Tyr 


Pro 


Asp 


450 








260 










265 










270 






452 


Ser 

































C/J) ^ Use of n and/or Xaa has been detected in the Sequence Listing. 
Y Review the Sequence Listing to insure a corresponding 

" explanation is presented in the <220> to <223> fields of 
each sequence using n or Xaa. 
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VERIFICATION SUMMARY DATE: 09/0.5/2001 

PATENT APPLICATION: . US/09/673 , 918 TIME: 14:37:48 



Input Set : A:\16430seq.txt 

Output Set: N:\CRF3\09052001\I673918.raw 



L: 


14 


271 




Current Filing Date differs, Replaced Current Filing Date 


L: 


351 


M 


:258 


W 


: Mandatory Feature 


missing, <223> not found for SEQ ID#:5 


L: 


351 


M 


:341 


W 


: (46) 


"n" or 


f, Xaa" 


used, 


for SEQ ID#:5 


L: 


352 


M 


:258 


w 


: Mandatory Feature 


missing, <223> not found for SEQ ID#:5 


L: 


352 


M 


:341 


w 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:5 


L: 


422 


M 


:340 


E 


: (46) 


"n" or 


"Xaa" 


used: 


Feature required, for SEQ ID#:6 


L: 


758 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:11 


L: 


761 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:11 


L: 


804 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:14 


L: 


823 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:15 


L: 


842 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:16 


L: 


879 


M 


:341 


W 


: (46) 


"n" or 


"Xaa" 


used, 


for SEQ ID#:18 
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